Probabilistic framework for solving visual dialog
نویسندگان
چکیده
In this paper, we propose a probabilistic framework for solving the task of ‘Visual Dialog’. Solving requires reasoning and understanding visual modality, language common sense knowledge to answer. Various architectures have been proposed solve by variants multi-modal deep learning techniques that combine representations. However, believe it is crucial understand analyze sources uncertainty task. Our approach allows estimating also aids diverse generation answers. The obtained through representation module provides us with representations image, question conversation history, ensures latent candidate answers are given an chooses appropriate answer minimizes uncertainty. We thoroughly evaluate model detailed ablation analysis, comparison state art visualization in method. Using framework, thus obtain improved dialog system more explainable.
منابع مشابه
A Frame-Based Probabilistic Framework for Spoken Dialog Management Using Dialog Examples
This paper proposes a probabilistic framework for spoken dialog management using dialog examples. To overcome the complexity problems of the classic partially observable Markov decision processes (POMDPs) based dialog manager, we use a frame-based belief state representation that reduces the complexity of belief update. We also used dialog examples to maintain a reasonable number of system acti...
متن کاملProbabilistic Dialog Management
Modeling user interfaces as dialogs provides a conceptual framework to address global coherence and efficiency of interactions. While non-probabilistic approaches provide convincing results and transparent dialog behavior, probabilistic techniques can help to account for inherent uncertainties in user input. In this paper, we present three patterns for probabilistic dialog management or support...
متن کاملWizArg: Visual Argumentation Framework Solving Wizard
Extension-based argumentation semantics have shown to be a suitable approach for performing practical reasoning. An important concern in extensionbased-argumentation semantics is the computational complexity of the decision problems that has been shown to range from NP-complete to Π 2 -complete. In this paper, we introduce a generic extension-based argumentation semantics solver, that is called...
متن کاملInteractive Visual Dialog
In this paper we propose a paradigm called the Interactive Visual Dialog (IVD) as a means of facilitating a system’s ability to recognize objects presented to it by a human. The presentation centers around a supermarket checkout scenario in which an operator presents an item to be tallied to a stationary television camera. An active vision approach is used to provide feedback to the operator in...
متن کاملCoDraw: Visual Dialog for Collaborative Drawing
In this work, we propose a goal-driven collaborative task that contains vision, language, and action in a virtual environment as its core components. Specifically, we develop a collaborative ‘Image Drawing’ game between two agents, called CoDraw. Our game is grounded in a virtual world that contains movable clip art objects. Two players, Teller and Drawer, are involved. The Teller sees an abstr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2021
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2020.107586